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Abstract Equine coronavirus has been responsible for 
several outbreaks of disease in the United States and Japan. 
Only one complete genome sequence (NC99 isolated in the 
US) had been reported for this pathogenic RNA virus. 
Here, we report the complete genome sequences of three 
equine coronaviruses isolated in 2009 and 2012 in Japan. 
The genome sequences of Tokachi09, Obihirol2-1 and 
Obihiro12-2 were 30,782, 30,916 and 30,916 nucleotides in 
length, respectively, excluding the 3’-poly (A) tails. All 
three isolates were genetically similar to NC99 
(98.2—98.7 %), but deletions and insertions were observed 
in the genes nsp3 of ORFla, NS2 and p4.7. 


Equine coronavirus (ECoV) is an enveloped, positive- 
stranded RNA _ virus’ belonging to the — species 
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Betacoronavirus I in the genus Betacoronavirus. Several 
outbreaks of ECoV, characterized by clinical symptoms of 
fever, anorexia, lethargy, leucopenia, and digestive prob- 
lems, have been reported in the United States [1] and Japan 
[2, 3]. These clinical signs were reproduced in our exper- 
imental challenge study [4]. However, to the best of our 
knowledge, only four ECoV strains have been isolated in 
the United States and Japan, and the complete genome 
sequence of this pathogen is only available for one 
American strain (NC99) [5]. To improve the molecular 
diagnosis of ECoV and to better understand its epidemi- 
ology, more genomic sequence data are required. There- 
fore, in this study, we determined the complete genome 
sequences of the remaining three ECoV strains isolated in 
Japan and compared them with that of NC99. 

Three ECoVs, Tokachi09, isolated in 2009 [2], Obihi- 
rol2-1 and Obihirol2-2, isolated in 2012 [3], were ana- 
lyzed in this study. These ECoVs were isolated from adult 
horses showing symptoms of diarrhea at Obihiro Race- 
course, Hokkaido, Japan. Tokachi09, Obihirol2-1 and 
Obihiro12-2 were passaged 4 to 5 times in HRT-18G cells 
[2, 3]. Viral RNA was extracted using a MagNA Pure LC 
Total Nucleic Acid Isolation Kit (Roche Diagnostics, 
Mannheim, Germany). RT-PCR was performed with six 
primer sets designed based on the complete genome 
sequence of NC99 (GenBank accession number: 
NC_010327) (Supplementary Table 1). RT-PCR was per- 
formed using a PrimeScript H High Fidelity RT-PCR Kit or 
PrimeScript II High Fidelity One Step RT-PCR Kit 
(Takara, Shiga, Japan). 

These PCR amplicons were sequenced using Ion Torrent 
technology (Thermo Fisher Scientific, MA, USA). Librar- 
les were constructed using an Ion Xpress Plus Fragment 
Library Kit and an Ion Xpress Barcode Adapters Kit. 
Emulsion PCR, enrichment and loading onto an Ion 318 
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Chip were performed automatically using Ion Chef, and 
sequencing was conducted in the Ion PGM system 
according to the manufacturer’s instructions. As a result, 
we obtained more than 140 million nucleotides (nt) for 
each ECoV, and the average depth was 14859.2 times. The 
raw signal data were analyzed using Torrent Suite version 
4.4.1, and NC99 was used as a reference sequence. The 
reads with more than 100 bases that could be mapped to the 
reference sequence were used. Torrent Variant Caller 
version 4.2 was used to call the mutated sites, and the most 
frequently observed alleles were selected in each of the 
called positions. Bases within the reference sequence were 
substituted with the observed alleles if frequencies excee- 
ded 50 %. 

The 5’- and 3’-end sequences were determined by Rapid 
Amplification of cDNA Ends (RACE) using a GeneRacer 
Kit with SuperScript III reverse transcriptase (Thermo 
Fisher Scientific) and a 3’-Full RACE Core Set (Takara), 
respectively. The primers used for 5’/3’ RACE are shown 
in Supplementary Table 1. PCR was conducted using 
HotStarTaq Plus DNA Polymerase (QIAGEN, Hilden, 
Germany). The amplified products were cloned using a 
TOPO TA Cloning Kit for Sequencing (Thermo Fisher 
Scientific), and the plasmids were sequenced commercially 
(Fasmac, Atsugi, Japan) using an M13 universal primer. At 
least three clones were sequenced per amplified product. 
Sequences were analyzed and assembled using the Vector 
NTI Advance 11 software (Thermo Fisher Scientific). 
Phylogenetic analysis of the nucleic acid sequences was 
conducted using MEGA 5.2 software [6]. Phylogenetic 
trees based on complete genome sequences were con- 
structed using the neighbor-joining method. Statistical 
analysis of the trees was conducted using the bootstrap test 
(1,000 replicates). 

The complete genome sequences of Tokachi09, Obihi- 
ro12-1 and Obihiro12-2 were 30,782, 30,916 and 30,916 nt 
in length, respectively, excluding the 3’-poly (A) tails. The 
accession numbers registered in the GenBank/EMBL/ 
DDBJ databases for these genome sequences are as fol- 
lows: Tokachi09 (LC061272), Obihirol2-1 (LC061273) 
and Obihirol2-2 (LC061274). The complete genome of 
NC99 shared 98.2 % nt sequence identity with Tokachi09, 
98.7 % with Obihirol2-1, and 98.7 % with Obihirol2-2. 
Phylogenetic analysis (Fig. 1) showed that ECoVs grouped 
separately from other members of the species Betacoron- 
avirus I, and Obihirol2-1 and Obihirol2-2, isolated during 
the same outbreak, were closely related. 

Putative genes of Tokachi09, Obihirol2-1 and Obihi- 
rol2-2 are shown in Table 1, and were compared with the 
genes published previously for NC99 [5]. The 5’ untrans- 
lated region (UTR) spanned nt | to 208 in the three Japa- 
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nese ECoVs, and the 3’-UTR spanned nt 30,494 to 30,782 
in Tokachi09 and nt 30,628 to 30,916 in Obthirol2-1 and 
Obihirol2-2. All three Japanese isolates exhibited a 1-nt 
deletion (nt 1) compared with isolate NC99. The 5’- and 
3’-UTRs were highly conserved between NC99 and the 
three Japanese isolates (5’-UTR, 98.6%; 3’-UTR, 
97.6-99.3 %). 

ORF la was located at nt 209-13,474 in TokachiO9 and 
nt 209-13,465 in Obihirol2-1 and Obihirol2-2. The 
sequence encoding the nsp3 protein within ORFla con- 
tained a 9-nt insertion (nt 3,175—3,183) in Tokachi09. In 
addition, a 33-nt deletion compared with NC99 was 
observed between 6,361 and 6,362 in Tokachi09, and 
between 6,352 and 6,353 in Obihirol2-1 and Obihirol2-2. 
Other betacoronaviruses, such as bovine coronavirus, por- 
cine hemagglutinating encephalomyelitis virus, and human 
coronavirus OC43, harbor longer deletions in the same 
region, indicating that this region may be prone to deletion 
mutations. Due to these deletions and insertions, ORFla in 
Tokachi09, Obihirol2-1 and Obihirol2-2 was 24, 33 and 
33 nt shorter than in NC99. Overall, ORFla and ORF lab 
were highly conserved between NC99 and the three Japa- 
nese ECoVs (ORF 1a, 98.4-99.0 % for nt and 97.8-98.9 % 
for amino acid [aa]; ORFlab, 98.9-99.1 % for nt and 
98.6—99.1 % for aa). 

The NS2 nt and aa sequences were identical in length 
between isolates Tokachi09 and NC99 and were highly 
conserved in sequence (98.6 % for nt and 97.8 % for aa). 
In contrast, Obihirol2-1 and Obihirol2-2 contained a 2-nt 
deletion (between 22,159 and 22,160) compared with 
NC99 and Tokachi09. This deletion created an additional 
stop codon, truncating the NS2 open reading frame by 84 
aa in these isolates compared with NC99 and Tokachi09. 
This deletion also created a start codon downstream of this 
stop codon, and another open reading frame (nt 
22,186—-22,410) may therefore exist in Obihirol2-1 and 
Obihiro12-2. 

Between genes p4.7 and p1l2.7, we reconfirmed that 
TokachiO9, Obihirol2-1 and Obihirol2-2 harbored dele- 
tions totaling 185, 40 and 40 nt, respectively, compared 
with NC99, as reported previously [2, 3]. These deletions 
would drastically shorten the length of the putative p4.7 
gene. However, it is yet to be confirmed that the p4.7 
gene is expressed and functional in these three Japanese 
ECoVs. 

The nt lengths of the HE, S, p12.7, E, M, N and I genes 
were identical between NC99 and the three Japanese iso- 
lates, and the nt and aa sequences of these genes were 
highly conserved: HE, 97.5—98.0 % for nt and 96.5-97.9 % 
for aa; S, 98.5-99.0 % for nt and 98.5—99.0 % for aa; 
pl2.7, 99.1% for nt and 99.1-100 % for aa; E, 
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Bovine coronavirus: Mebus (U00735) 


Bovine coronavirus: Kakegawa (AB354579) 

Canine respiratory coronavirus: K37 (JX860640) 

Dromedary camel coronavirus HKU23: HKU23-265F (KF906249) 

Human coronavirus OC43: ATCC VR-759 (AY585228) 

Porcine hemagglutinating encephalomyelitis virus: VW572 (DQ011855) 
Equine coronavirus: NC99 (NC_010327) 
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Fig. 1 Phylogenetic analysis of the complete genome sequences of 
members of the species Betacoronavirus 1. The three equine 
coronavirus strains examined in this study are indicated by closed 
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circles. The percent bootstrap support is indicated by the value at each 
node, and values less than 70 % are omitted 


Table 1 Comparison of putative genes in NC99 with those in Tokachi09, Obihiro12-1 and Obihirol 2-2 


Gene NC99* Tokachi09 
Genome No. of No. of Genome 
position nucleotides amino acids position 

5’ UTR 1-209 209 1-208 

ORF la 210-13,499 13,290 4429 209-13,474 

ORF lab 210—21,595 21,386 7128 209-21,570 

NS2 21,610—22,446 837 278 21,585-22,421 

HE 22,458—23,729 L272 423 22,433-23,704 

S 23,744-27,835 4,092 1363 23,719-27,810 

p4.7 27,825—27,947 123 40 27,800-27,841 

p12.7 28,076—28,405 330 109 27,866-28,195 

E 28,392—28,646 255 84 28,182-28,436 

M 28,661—29,353 693 230 28,45 1-29,143 

N 29,363-30,703 1,341 446 29,153-30,493 

I 29,424—30,044 621 206 29,214-29,834 

3’ UTR 30,704—30,992 289 30,494-30,782 


“ Sequence data of NC99 are from Zhang et al [5] 


98.4-99.6 % for nt and 98.8-100 % for aa; M, 
98.7-99.3 % for nt and 99.1—100 % for aa; N, 98.0-98.2 % 
for nt and 97.3—97.5 % for aa; I, 97.4—-98.1 % for nt and 
95.1-95.6 % for aa. 

In conclusion, three ECoV isolates from Japan (Toka- 
chi09, Obihirol2-1 and Obihirol2-2) are genetically sim- 
uar to NC99, isolated in the United States, with the 
exception of the nsp3, NS2 and p4.7 genetic regions. It will 
now be interesting to investigate whether these mutations 
alter the antigenicity and pathogenicity of ECoV. 
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Obihiro12-1 and Obihiro12-2 


No. of No. of Genome No. of No. of 
nucleotides amino acids position nucleotides amino acids 
208 1-208 208 
13,266 4,421 209-13,465 13,257 4,418 
21,362 7,120 209-21,561 21,353 7,117 
837 278 21,576-22,160 585 194 
L272 423 22,422-23,693 1,272 423 
4,092 1,363 23,708-27,799 4,092 1,363 
42 13 27,789-27,812 24 7 
330 109 28,000-28,329 330 109 
259 84 28,316-28,570 25) 84 
693 230 28,585-29,277 693 230 
1,341 446 29,287-30,627 1,341 446 
621 206 29,348-29,968 621 206 
289 30,628-30,916 289 
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